Automatic Detection of Thesaurus relations for Information Retrieval Applications
نویسنده
چکیده
Is it possible to discover semantic term relations useful for thesauri without any semantic information? Yes, it is. A recent approach for automatic thesaurus construction is based on explicit linguistic knowledge , i.e. a domain independent parser without any semantic component, and implicit linguistic knowledge contained in large amounts of real world texts. Such texts include implicitly the linguistic, especially semantic, knowledge that the authors needed for formulating their texts. This article explains how implicit semantic knowledge can be transformed to an explicit one. Evaluations of quality and performance of the approach are very encouraging.
منابع مشابه
Could We Automatically Reproduce Semantic Relations of an Information Retrieval Thesaurus?
A well constructed thesaurus is recognized as a valuable source of semantic information for various applications, especially for Information Retrieval. The main hindrances to using thesaurus-oriented approaches are the high complexity and cost of manual thesauri creation. This paper addresses the problem of automatic thesaurus construction, namely we study the quality of automatically extracted...
متن کاملSemantic relations in information science
This chapter examines the nature of semantic relations and their main applications in information science. The nature and types of semantic relations are discussed from the perspectives of linguistics and psychology. An overview of the semantic relations used in knowledge structures such as thesauri and ontologies are provided, as well as the main techniques used in the automatic extraction of ...
متن کاملModifiers of Conceptual Relations in Thesaurus for Automatic Conceptual Indexing
The paper describes representation and use of variable and nontypical relations of concepts in the Sociopolitical thesaurus constructed specially as a tool for automatic text processing. The variable and non-typical conceptual relations are described in the simplest way by marking of relations by modifiers. The simplicity of notation facilitates representation of such relations for large number...
متن کاملIdentifying Semantic Relations in Text for Information Retrieval and Information Extraction
Automatic identification of semantic relations in text is a difficult problem, but is important for many applications. It has been used for relation matching in information retrieval to retrieve documents that contain not only the concepts but also the relations between concepts specified in the user’s query. It is an integral part of information extraction—extracting from natural language text...
متن کاملAutomatic thesaurus construction
In this paper we introduce a novel method of automating thesauri using syntactically constrained distributional similarity. With respect to syntactically conditioned cooccurrences, most popular approaches to automatic thesaurus construction simply ignore the salience of grammatical relations and effectively merge them into one united ‘context’. We distinguish semantic differences of each syntac...
متن کامل